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Abstract 

When one is presented with an item or a face, one can sometimes 
have a sense of recognition without being able to recall where or when 
one has encountered it before. This sense of recognition is known 
as familiarity. Following previous computational models of familiar- 
ity memory we investigate the dynamical properties of familiarity dis- 
crimination, and contrast two different familiarity discriminators: one 
based on the energy of the neural network, and the other based on 
the time derivative of the energy. We show how the familiarity sig- 
nal decays after a stimulus is presented, and examine the robustness 
of the familiarity discriminator in the presence of random fluctuations 
in neural activity. For both discriminators we establish, via a com- 
bined method of signal-to-noise ratio and mean field analysis, how the 
maximum number of successfully discriminated stimuli depends on the 
noise level. 

Keywords: Recognition memory, Familiarity discrimination, Stor- 
age capacity. 

Abbreviations: SNR, Signal-to-Noise Ratio; FamE, Familiarity dis- 
crimination based on Energy; FamS, Familiarity discrimination based 
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on Slope. 



Introduction 



Recognition memory is supported by at least two different types of re- 
trieval processes: recollection and familiarity. While recollection requires 
detailed information about an experienced event, familiarity just distin- 
guishes whether or not the stimulus was previously encountered. A well 
known example is the encounter with a colleague during a conference: one 
might recognize the person, but fail to remember the time and place of an 
earlier meeting. 

Familiarity memory is thought to h ave a very large capacity. In the 



19731 ) tested the capac- 



early 1970s, Standing and collaborators (|Standingl . 
ity in humans by presenting participants with a large number (10,000) of 
images. Surprisingly, after this one-shot learning, participants were able to 
successfully recognize most of the previously seen pictures, suggesting that 
the capacity of recognition memory for pictures is very large indeed. 

Experimental psychologists have formulated dual-process theories which 
characterize the precise contrib ution of familiari ty and recollection to recog- 



nition memory, for a review see (lYonelinasl . 



2002 ). Anatomically, researchers 



have proposed that different brain areas are engaged during recollection 
and familiarity processing. Single item familiarity is believed to be pro- 



cessed in the perirhinal cortex, whe reas recollection is b e 



ieved to engage 



2001 



) . Further- 



the hippocampus, for a review see ([Brown and Aggletonl . 
more, ele ctro-physiological studies using single cell r ecordings in monkeys 



and rats (Brown et al 



19871 : 



Brown and Xiang 



19981 ) report that about 30 



percent of neurons in the perirhinal cortex show increased activity after pre- 
senting new compared to old stimuli. These neurons have been interpreted 



as novelty detectors. However, this association between the memory pro- 



cesses and brain area is stih unclear anc 



the stimulus (jAggleton and Brown 



20051 : 



seems to depend on t 



le nature of 



Rugg and Yonelinas 



2003) 



Recent ne uroimaging studies u s ing, f or example, event-related poten- 

2OO3I ) ■ have revealed that familiarity 



tials (ERPs) (jRugg and Yonelinas 



and recollection have distinct temporal characteristics. Familiarity is linked 
to a frontal ERP modulation that occurs around 300-500ms post-stimulus 



presentation, whilst recollection is thought to evoke a parietal ER. 



ulation around 50 0-800ms after stimulus presentation (jRugg et al. 



Greve et al. 



mod- 



19981 : 



20071 ). Therefore, the speed of processing of familiarity dis- 
crimination is more rapid than recollection. Behavioral experiments pro- 
vide further evidence for this: if only very limited time is available for a 



recognition decision, p articipants re 



recollection processes (jPosheri . 



19M)- 



y primarily on familiarity as opposed to 



In the field of computational neuroscience, modeling of recollection via 



attractor neural networks has a 



networks (jHopfield 



1982 



Amit 



ong 



history using auto-associator Hopfield 



been studied much more recently 



19891). Familiarity disc r imina tion has only 



Bogacz and Brown . 



20031). Computa- 



tional models of familiarity discrimination have a much higher storage ca- 
pacity for recognition than associative memory networks that perform as- 
sociative recall. For a wide range of conditions, Bogacz et al. showed that 
the maxi mum storage is proportion al to the number of synapses within the 



network ([Bogacz and Brown . 



20031 ). This is much larger than the capacity 



to recall, which is proportional to the square root of the nu mber of syri apses 



(i.e. the number of neurons in a fully connected network) (jAmiti . 



19891 ). In- 



tuitively this is easily understood; familiarity needs to store just a single 
bit (familiar versus non-familiar) per pattern, whereas to recall an event 
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requires the retrieval of the whole pattern (pattern completion) . 

In this paper we study how the dynamics of the network affects famil- 
iarity discrimination. We compare two different familiarity discriminators: 
familiarity based on Energy, FamE, w hich was previously introduced by Bo- 



gacz et al (jBogacz and Brown 



20031'). and a fam iliarity discriminator which 



is the time derivative of FamE ( 



Hopfield 



1982 ). From here on, we will call 



this latter discriminator the slope, and label it FamS. 

We show in our model how the signal for both familiarity discriminators 



(Rugg et al.. 


1998; 


Greve et al. 


2007) 



to investigate the robustness of familiarity detection, we study how it is af- 
fected by random fluctuations that are ubiquitousl y present in the nervous 



19891 ). this exter- 



system. As in models of attractor neural networks (jAmiti . 
nal source of noise is taken to be independent of the learned patterns and is 
controlled by a temperature parameter. 



Two familiarity discriminators 

We consider a network of N binary neurons, each with an activity ■Sj(t) = 
±1, the two states corresponding respectively to firing and not firing. The 
complete network activity is characterized by s(t). Any two neurons are 
connected by synaptic weights Wij. As standard in artificial network models, 
the network has a learning phase in which it encodes M stimuli x'' = {xf 
{p = 1 . . . M) in its weights using a Hebbian learning rule 

-j^ M 
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It can be shown that this learning rule is optimal in the limit of large A^, M 
(unpublished results). During the subsequent test phase, the network's per- 
formance is evaluated. At i = 0, the probe stimulus p (which is either a 
familiar or novel stimulus) is loaded into the network, s(t = 0) = x''. 

To define the network dynamics we assume that each neuron is up- 
dated precisely once, probabilistically and asynchronously, in each unit of 
time. (The exact duration that a time unit in the model corresponds to is 
hard to extract by comparing the model to, say, ERP data given the addi- 
tional delays present in the biology, but it should probably be on the order 
of 10. .100ms.) As standard in artificial neural networks, and in analogy 
with magnetic systems in physics, the random fluctuations are controlled 
by a temperature parameter T. These so-called Glauber dynamics have 
been extensively stud i ed in many different stochastic systems, for instance 



(Marro and Dickman 



19991 ). The probability distribution, after update, is 



given then by 

where /? = 1/T is the inverse temperature parameter, and hi{t) = J2jLi WijSj^t) 
is the total presynaptic current to the neuron i. Accordingly, for low tem- 
perature, the noise is small and there is a strong positive correlation between 
the input current hi and the output Sj, whilst for high temperature the out- 
put of a node is dominated by noise and is more or less independent of its 
input. 

The energy in the network at time t is defined as 

E{t) = -Y,WijSi{t)sj{t). (3) 
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As was previously reported in (iBogacz and Brownl . 120031 ) . the energy E[t = 
0) is able to effectively discriminate between old and novel stimuli. As we 
explain later, this energy is of order — (A^ + M) for learned stimuli, but of 
order — M for novel stimuli. Consequently, the energy or familiarity for old 
and novel stimuli are macroscopically different (they differ by order A^, while 
the std.dev.= \/2M) and the difference can thus be used as a familiarity 
discriminator. We call this discriminator FamE. 

However, the use of the energy is only one possible approach to model 
familiarity discrimination. The time derivative, or slope, of the energy 
S = ^^1^ can also be used as a familiarity discriminator. It indicates how 
quickly the network's energy changes, when either a novel or old stimulus 
is presented. Interestingly, this familiari ty measure was originally proposed 



1982), but to the best of 



by Hopfield in his seminal 1982 paper (iHopfieldl . 
our knowledge it has never received further exploration. We call this dis- 
criminator FamS. 

For convenience, we shall express the energy and the slope as functions 
of the M-dimensional vector m(t) = {'m'' {t)}pLi, the overlaps between the 
current network activity and each of the stored patterns. The components 
of this overlap vector are defined by 



1 



N 



m' 



(4) 



i=l 



Assuming the Hebbian learning rule ([T]), the energy ^ in terms of the 
overlaps is given by 

M 

~ " (5) 



Eit) = -Nj2[m%t)f, 
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whilst the slope (first derivative of the energy) is given by 



M 



dmP{t) 
di 



S{t) = -2NY,mP{t) 



(6) 



p=i 



and is thus proportional to the time derivative dmP{t)/dt of the overlaps. 

Dynamics of familiarity discrimination 

To mathematically address the network dynamics we assume the mean field 
approximation, i.e. Si ~ (sj). Under this approximation one obtains from 
equation (l2|), the dynamical equations for the overlaps (jH): 



The mean field formulation provides an accurate description of the dynamics 
of the system provided the temperature is not too high (see below). 

Knowing the dynamics, we focus on the time evolution of the two dis- 
criminators, energy and slope, defined in the previous section. To measure 
the temporal persistence, Figure [1] illustrates the time evolution of FamE 
and FamS when tested with novel or old stimuli. We compare the time 
evolution by simulations with Glauber dynamics given by equation ([2]), and 
by using the mean field dynamical equations ([7]). 



Figure [T] A and B, shows how the energy associated with old stimuli is 
much lower than for new stimuli. However, after a short transient of 4-5 
units of time, both signals become similar to each other, i.e. familiarity dis- 
crimination based on energy deteriorates rapidly post stimulus presentation. 



dmP{t) 
Jt 



N M 



"^"W + ^E^tanhi/?^ 

1=1 v = l 



xrm^(t)]. 



(7) 



(FIGURE 1 HERE) 
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Like the energy, the slope also shows a transient signal when the network 
is presented with a novel vs old stimulus, figure [T] graphs C and D. For low 
temperature, the slope for old stimuli is practically zero. This can be easily 
interpreted. An old stimulus corresponds to one of the local minima (attrac- 
tors) of the energy landscape. Because the temperature is low, and therefore 
the system is not receiving any external perturbation, the energy does not 
change, and its time derivative is practically zero. Similar to the energy, the 
slopes associated with old and new stimuli show significant differences im- 
mediately after stimulus presentation, but this difference diminishes shortly 
thereafter. 

Summarizing, both discriminators can distinguish old from new stimuli 
immediately after stimulus presentation, but after a very short transient 
(of the order of five time units), the discrimnation ability disappears. The 
slope tends to zero as time progresses because the network evolves towards 
a fixed point and becomes stationary (i.e. S" ~ 0). Though measures to 
discriminate spurious from no n-spurious attractor states have been proposed 



(Robins and McCallum 



2004 ). such measures do not directly translate into 



a discrimination between old and novel stimuli. 



Robustness of the familiarity discriminators 

To examine the performance of the two familiarity discriminators intro- 
duced in the previous section, we quantify the discriminability between the 
network responses to either new or old stimuli by the signal-to-noise ratio 
(SNR). Assuming two Gaussian probability distributions, M[fj,newi ^ncw] 
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■^[/^oid) fold]' associated with new and old stimuli, we define 



SNR 



(8) 



2 new 2 old 



To check that the distributions are indeed Gaussian, we repeated the sim- 
ulation of Fig. [T] 100 times and computed the probability distributions. 
For both FamE and FamS the 4th moments of their distribution satisfied 
(x^) = / P{x)x^dx = + 6/.i^(7^ + 3(7^0"^, with a relative error smaller than 
5%, (where fi = (x) denotes the mean and o"^ = (x^) — (x)"^ the variance), 
indicating that the distributions are well approximated by Gaussians. 

We address here how random fluctuations in neural activity (independent 
of the learned patterns) affect the performance of the familiarity discrimina- 
tors. We study the effect of temperature at two different time points, t = 
and t = 1. As stated above, time is defined such that in one unit, all neurons 
are asynchronously updated once. The choice of t = 1 is not special; we just 
study the network properties at this time to gain understanding as to how 
the network evolves. 

The results are illustrated in flgureEJ Immediately after stimulus (t = 0), 
we observe that FamE is independent of the temperature value (flgure[2jA), 
whilst FamS has a non-linear dependence on the temperature (flgure[2jC). 
For high temperature, FamS performs better as a familiarity discriminator. 
This flnding can be intuitively understood. The energy and its time deriva- 
tive can be separated into signal and noise contributions. The signal for the 
slope is proportional to the rate of change of the energy, and therefore pro- 
portional to the rate of change of the overlap between the network activity 
and the stimulus. At low temperatures, the signal associated with an old 
stimulus is very low as the overlap with the stimulus is almost invariant. 
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Contrarily, at higher temperature, the overlap with old stimuli changes very 
quickly; it decays from 1 to 0, and consequently the slope-signal relation- 
ship increases considerably (the higher temperature, the higher signal for 
FamS). The noise component for the slope, although dependent on T, is 
similar for both old and novel stimuli. As a result the main temperature 
dependence stems from the signal term. In figure O the case of T > 1 is 
not explicitely studied because in this region the network can not retrieved 
any of the learned patterns, i.e. the only stable solution is m = 0, what is 



19891 ) 



so-called paramagnetic or non-memory solution (jAmit 
(FIGURE 2 HERE) 



In contrast to time t = 0, at time t = 1, post stimulus presentation, both 
discriminators FamE and FamS show a similar breakdown in discrimination 
for increased temperature (figure [2jF). In the next section, we analytically 
study the maximum storage capacity for both FamE and FamS at time t = 0. 
The results are in agreement with the simulations. For t = 1, the mean field 
predictions, however, do not reproduce the network simulations. To study 
such situations (which we do not explicitly deal with here), one wo uld need 



to us e other techniques, for example, generating functional analysis (jCoolenl . 
200l|). 



Maximum storage capacity 

When the number of stimuli encoded in the weights increases, the SNR 
decreases. One can define the storage capacity (or maximum number of 
stimuli encoded in the learning rule and successfully discriminated) as the 
point where the SNR drops below one. This gives the maximum number of 
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stimuli Mmax that can be encoded in the network. In this section we present 
exphcit calculations for both discriminators FamE and FamS for time t = 0. 



Storage capacity of FamE 



Let p = p label an o ld stimulus 



in these calculations (Herzetal 



] presen ted to the network. As is common 
199ll ). we separate the sum appearing in 



equation ^ into a signal {p = p) plus noise contributions. The latter is 
determined by interference from previously stored stimuli (p 7^ p). From 
equation ([5]) it follows that the energy associated with old stimuli is dis- 
tributed as 

i^oid (t) = -N[mP - N Y.[mP{t)f. (9) 

The first term on the right hand side is the signal and the second one the 
noise contribution. At t = 0, we obtain mP{t = 0) = 1 because the pattern 
p was an old stimulus. As for large N the central limit theorem applies, the 
overlaps with the other patt erns p 7^ p have a Gaussian distribution with 



average and variance (lAmit et al 



19871 ). Accordingly, we can easily 



compute the expected value and the variance for the energy. Using that the 
sum of two Gaussian distributed variables is again a Gaussian distribution, 



Eoid{t = 0) G M[-{N + M), 2M]. 



(10) 



Analogously, the energy for novel stimuli is distributed as 



^new(t = 0) G AA[-M,2M]. 



(11) 



From equation ([8]) we obtain SNR = y/N'^/{2M), in agreement with the 
simulations (see figure [2jE). Equivalently, the maximum storage capacity. 
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(the M for which SNR = 1), is given by 



Ar2 

M„,ax[FamE,t = 0] = — , (12) 



and thus the storage is of order A^ ^, which has been reporte d in previous 



computational models using FamE (jBogacz and Brownl . 



2003). 



Storage capacity of FamS 



Following the same strategy applied to FamE to FamS, we are able to sep- 
arate signal {p = p) and noise {p ^ p) terms for the slope. At the instant of 
the stimulus presentation (t = 0), we substitute equation d?]) in equation ([6|). 
Next, we apply the central limit theorem, which is a good approximation 
for large A^. It ensures that the sum over the different sites i of the noise 
contribution Ylip^p xImP appearing inside the tanh function, is equivalent to 
the average over a Gaussian noise with mean and variance a = M/N , the 
network load. Using these considerations, it is straightforward to obtain 

Soid{t = 0) G [2iV (1 - /i - h) + 2M, 8M] , (13) 



and for novel stimuli 



5new(t = 0) e [-2A^/3 + 2M, 8M] . (14) 



The integrals /i, I2 and I3 appearing in equations (|13p and (jl4p are 

_dz 
s/2'j 

s/2'j 

— exp (— 2^/2) tanh {fi^foLz) ^foLZ., 



/dz 
exp i-z^ ji) tanh (/3 + f5^faz) , 
V 27r ^ ' 

hia,P) = /" exp (-2^/2) tanh (/J + /3Va-z) (15) 
J \/2tt ^ ' 
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where (3 = 1/T is the inverse temperature. Prom equations ()13p and (I14p 



it follows that SNR = ^/N'^/{2M)[l - Ii{a, (3) - hia, (3) + h{a, /3)], which 
can be computed numerically. The results are represented in figure[2JE. The 
expected values used in the signal-to-noise ratio calculation (figure ElC) fits 
well with the simulations. However, the theoretical predictions for the vari- 
ance of both FamS(old) and FamS(new) equals SM, independent of tem- 
perature, which is in disagreement with the simulations. Therefore, the 
signal-to-noise ratio calculation disagrees with simulations for high temper- 
atures (figure ElE). See the appendix for a more detailed calculation of how 
the mean field prediction is affected by high temperatures. 

(FIGURE 3 HERE) 

The maximum storage for FamS is again obtained by solving SNR= 1, 
which yields 

Mrnax[FamS,t = 0] = — (1-Ii(amax, /3) -/2(amax, /9)+/3(amax, (16) 

Because the integrals /i(a,/3), l2{a,(3) and 13(0, P) depend on M, this ex- 
pression does not give us Mmax explicitly. The dependence on N is more 
complic ated than for other comp utational models of familiarity discrimi- 



nation (jBogacz and Brown 



20031 ) ■ (and in particular for FamE above), for 
which the maximum storage capacity is directly proportional to N'^. Inter- 
estingly, Mmax for FamS at t = is dependent on the temperature, whilst 
FamE is completely independent of temperature (recall figure [51 graphs A 
and C). 

In the two limits T = and T — > 00 we can perform the integrals in 
equation ()16p to obtain Mmax explicitly. For T = 0, the integrals (llSh can 
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be computed using 



/3 

1 

/3 



exp i-z^ li\ tanh [Q \az + 61 ) = erf ( 
lim /-^expf-z^/2)tanh(/3[az + 6])2; = exp ( ,{17) 



giving lim/3^oo/i(a,/3) = erf lim/s^oo hia, (3) = \/^exp(-^) 

and lim^^oo ^3(05 /9) = y^^- Here, erf(x) is the error function erf (x) = 
Jq exp (— u^) du. Therefore at T = equation becomes 



M„,ax = — I 1 - erf I W:^^^ | + 



1 — exp 



N 



2M„ 



2 1 \\j 2Mmaxy V 

'(18) 

In figure El we plot, as a function of A^, the ratio of the initial zero temper- 
ature storage for FamS and FamE. We see that although FamS performs 
slightly worse than FamE, both storage capacities grow proportional to N'^. 
By way of example, for = 1000, we see Mmax[FamS,t = 0,T = 0] 
96%Minax[FamE, t = 0], i.e. the capacities are almost identical. 

In the other limit that T ^ 00, random fluctuations in neural activity 
dominate the network dynamics. All the integrals of (|15|) are zero, and 
hence Mmax[FamS,t = 0] ~ Minax[FamE,i = 0]. That is, in this limit, the 
theoretical maximum storage is the same for both FamS and FamE, and is 
independent of T. 



Discussion 

Familiarity describes a retrieval process that supports recognition memory 
by providing a feeling that something has been encountered before. Nu- 
merous empirical studies have investigated familiarity processes in humans 
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( Yonelinas 



20021 ) and non-humans (iBrown and Xiane 



19981 ). Recently, 



some neuronal networks mode^ 



proposed (jBogacz and Brown 



ing fa miliarity discrimination have also been 



20031 ) ■ However, no computational work has 
addressed the dynamics of familiarity discrimination, which is relevant when 
comparing these models to experiments. Furthermore, we have studied how 
noise affects the familiarity performance. 



We have compared the energy discriminator (FamE) used by (jBogacz and Brownl . 



20031 ) to its time derivative, the slope (FamS). Interestingly, the FamS dis- 



crimi nator was already suggested by Hopfield in his seminal work (jHopfield 



19821 ). An interesting consequence is that the original Hopfield model can 
be used to model both recollection (stationary properties of the retrieval dy- 
namics) and familiarity (transient dynamics after the stimulus presentation). 
The slope discriminator (FamS) is affected by the temporal dependency of 
the energy discriminator (FamE). In other words, the slope discriminator 
captures the fact that the speed of discrimination is predictive for the dis- 
crimination outcome per se. 

For both discriminators the familiarity signals decay quickly after stim- 
ulus presentation and are detectable only for a short period of time. This 
can be compared to the speed of recollection. Assuming that recollection 
memories correspond to attractors in the Hopfield model, recollection in- 
formation only becomes available once the attractor state is reached. By 
that time, the slope is zero, and the energy difference is very small. Thus 
the experimentally observed timing difference of familiarity and recollection 
follows naturally from our model. 

The storage capacity of these familiarity discriminators is much larger 
(proportional to N'^) compared to recollection (proportional to N), we demon- 
strated that this capacity is dependent on the temperature. We have pre- 
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sented a detailed derivation of the maximum storage immediately after stim- 
ulus presentation (t = 0). We have shown that for low temperature, the stor- 
age capacity related to FamS is lower than that for FamE, but still scales 
with the number of synapses, e.g. for = 1000, the slope gives a storage 
capacity 96% as good as the energy. For high temperatures the difference 
between the storage capacities of FamS and FamE is negligible (the storage 
capacity for both is the approximately, N'^/2). 

Interestingly, this means that the performance of FamS improves as one 
goes to the high temperature regime, a fact which is a priori counterintuitive. 



especia . 



(Amit 



ly gi ven how the temperature affects recollection in Hopfield nets 



1989 ). i.e. the higher the temperature, the worse the recollection 
performance. However, after some time steps, our simulations (figure ElF) 
show that, for both FamE and FamS, high noise levels produce a stochastic 
disruption of the discrimination, decreasing the SNR and the performance 
of familiarity, a concurrence with the dynamics of recollection. 
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Appendix: Mean field validity dependence on temperature 

To compute the slope in equation ([6]), we need an analytic expression for 
dm^/dt, or equivalently, given the definition (jl]), we have to compute the 
derivative dsi/dt, which is governed by the Glauber dynamics given by ([2]), 
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see (jMarro and Dickmad . 119991 ) for more detailed situations. Given Si{t), 
the Glauber dynamics give an uncertainty in Si{t + 1), such that 



Var[si(t + l)\{sj{t)}] = sech\f3hi{t)) , (19) 

which implies 

ds- 

= tanh(/3/ii) -Si + 0{sech{(3hi)) . (20) 
We use this result to find the error induced in our Ccilciilation of Syicw 

When 

a new pattern is presented, the m'' are all of order A^~^/^. This implies that 



the local fields, defined as hi = J2p xlmP , are of order ^/ci = \JM/N . Hence, 
by equations (jH) and ()20p . the error in our calculation of dm'^/dt is given by 

Error (^^) = O (^sech (/J^^)) , (21) 

for each p. Finally, by ©, we conclude that 



Error (5new) = O \ y/U sech ( 1./^ 1 1 . (22) 




TV N 



Since sech(x) decays exponentially with large x, but is of order 1 for small 
X, the error in our calculation of Snew, coming from the mean field approxi- 



mation, is only going to be negligible in the limit in which {1/T)^/M/N is 
large. This explains why there is a growing discrepancy between theory and 
simulation as the temperature T is increased (see figure [2jE). 
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Figure 1: Temporal persistence of discrimination by familiarity. For 

different values of the temperature parameter, T = 0.20 on the left and 
T = 0.60 on the right, we simulate a network of = 1000 neurons and 
M = 50 uncorrelated patterns. Both FamE and FamS can discriminate 
between novel and old stimuli during a short period post stimulus presenta- 
tion. After this, the slope begins to tend to zero, indicating that the activity 
has converged to one of the stored stimuli. This is due to the well-known 
pattern completion dynamics that occurs in attractor neural networks. One 
unit of time is defined as the time taken to update the whole population of 
neurons in the network. 
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Figure 2: Robustness of discrimination by familiarity. Immediately 
after stimulus presentation, left graphs, FamE is independent of temper- 
ature, whereas FamS is enhanced if the temperature parameter increases. 
After one timestep, right graphs, both FamE and FamS deteriorate for high 
values of temperature. We represent the values of the energy (top graph) 
and the slope (middle) with the standard deviation. On the bottom, each 
point in the curves corresponds with a fixed value of temperature, in which 
we compute the SNR concerning the probability distributions of the net- 
work responses towards both familiar and novel stimuli. These simulations 
correspond with averaging over 100 runs of a network with N = 1000 neu- 
rons and M = 50 uncorr elated patterns. Black solid lines are the theoretical 
predictions (see text for details). 




Figure 3: Ratio of initial storage capacities at zero temperature. 

The storage of discriminator FamS is obtained by numerical solution of 
equation ()18p as a function of the number of neurons N. This is normalized 
by the storage for FamE p2|) . to obtain a ratio of the performances of the 
two discriminators. 
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